A Quantiication of Distance-bias between Evaluation Metrics in Classiication
نویسندگان
چکیده
This paper provides a characterization of bias for evaluation metrics in classiication (e.g., Information Gain, Gini, 2 , etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing if the distance between the bias of two evaluation metrics correlates with diierences in predictive accuracy when we compare two versions of the same learning algorithm that diier in the evaluation metric only. Experiments on real-world domains show how the expectations on accuracy diierences generated by the distance-bias measure correlate with actual diierences when the learning algorithm is simple (e.g., search for the best single-feature or the best single-rule). The correlation, however, weakens with more complex algorithms (e.g., learning decision trees). Our results show how interaction among learning components is a key factor to understand learning performance.
منابع مشابه
Evaluation Metrics in Classiication: a Quantiication of Distance-bias
This paper provides a characterization of bias for evaluation metrics in classiica-tion (e.g., Information Gain, Gini, 2 , etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing if the d...
متن کاملEvaluation Metrics in Classification: A Quantification of Distance-Bias
This paper provides a characterization of bias for evaluation metrics in classification (e.g., Information Gain, Gini, χ, etc.). Our characterization provides a uniform representation for all traditional evaluation metrics. Such representation leads naturally to a measure for the distance between the bias of two evaluation metrics. We give a practical value to our measure by observing the dista...
متن کاملReview of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملSubmitted to CVPR ' 99 Discriminant Analysis based Feature ExtractionW
We propose a new feature extraction scheme called Discriminant Component Analysis. The new scheme decomposes a signal into orthonormal bases such that for each base there is an eigenvalue representing the discriminatory power of projection in that direction. The bases and eigenvalues are obtained based on certain classiication criterion. For simplicity, a criterion used in Fisher's Discriminant...
متن کاملHanding the Microphone to Women: Changes in Gender Representation in Editorial Contributions Across Medical and Health Journals 2008-2018
The editorial materials in top medical and public health journals are opportunities for experts to offer thoughts that might influence the trajectory of the field. To date, while some studies have examined gender bias in the publication of editorial materials in medical journals, none have studied public health journals. In this perspective, we studied the gender ratio ...
متن کامل